Finding Regressions in Projects under Version Control Systems

نویسندگان

  • Jaroslav Bendík
  • Nikola Benes
  • Ivana Cerná
چکیده

Version Control Systems (VCS) are frequently used to support development of large-scale software projects. A typical VCS repository of a large project can contain various intertwined branches consisting of a large number of commits. If some kind of unwanted behaviour (e.g. a bug in the code) is found in the project, it is desirable to find the commit that introduced it. Such commit is called a regression point. There are two main issues regarding the regression points. First, detecting whether the project after a certain commit is correct can be very expensive as it may include large-scale testing and/or some other forms of verification. It is thus desirable to minimise the number of such queries. Second, there can be several regression points preceding the actual commit; perhaps a bug was introduced in a certain commit, inadvertently fixed several commits later, and then reintroduced in a yet later commit. In order to fix the actual commit it is usually desirable to find the latest regression point. The currently used distributed VCS contain methods for regression identification, see e.g. the git bisect tool. In this paper, we present a new regression identification algorithm that outperforms the current tools by decreasing the number of validity queries. At the same time, our algorithm tends to find the latest regression points which is a feature that is missing in the state-of-the-art algorithms. The paper provides an experimental evaluation of the proposed algorithm and compares it to the state-of-the-art tool git bisect on a real data set.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Version of Earned Value Analysis for Mega Projects Under Interval-valued Fuzzy Environment

The earned value technique is a crucial and important technique in analysis and control the performance and progress of mega projects by integrating three elements of them, i.e., time, cost and scope. This paper proposes a new version of earned value analysis (EVA) to handle uncertainty in mega projects under interval-valued fuzzy (IVF)-environment. Considering that uncertainty is very common i...

متن کامل

A Systems Dynamics Model for Project Management systems of Project-Based Organization

It is obvious that the success of a project-based organization is dependent on its projects. A variety of tools such as the project excellence model, project management maturity models, the earned value method, have been developed in this regard, but there are still delays in projects because the projects have dynamic nature with non-linear relationships and feedback processes during the projec...

متن کامل

Collaborative Topic Modeling for Recommending GitHub Repositories

The rise of distributed version control systems has led to a significant increase in the number of open source projects available online. As a consequence, finding relevant projects has become more difficult for programmers. Item recommendation provides a way to solve this problem. In this paper, we utilize a recently proposed algorithm that combines traditional collaborative filtering and prob...

متن کامل

Effects of control on the performance of information systems projects: The moderating role of complexity risk

Control of projects is a core issue for organizations. Successful projects, such as information systems projects, enable organizations to develop a superior supply network and enhance the capability of operations management. A few studies have investigated the effects of control on project performance; however, complexity risk has not been integrated into the relationship between control andper...

متن کامل

Performance Appraisal of Research and Development Projects Value-Chain for Complex Products and Systems: The Fuzzy Three-Stage DEA Approach

The purpose of the current research is to provide a performance appraisal system capable of considering the value chain network structure of research and development (R&D) projects for Complex products and systems (CoPS) under uncertainty of data. Therefore, in order to achieve this goal, a network data envelopment analysis (NDEA) approach and the possibilistic programming to provide a new fuzz...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1708.06623  شماره 

صفحات  -

تاریخ انتشار 2017